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A  Detailed  Description  of  the  VLSI-PLM  Instruction  Set 


Bruce  K.  Holmer 
Aquarius  Group 
Computer  Science  Division 
University  of  California,  Berkeley 


1.  Introduction 

This  document  describes  the  VLSI-PLM  instruction  set  and  includes  small  programs  to  test  details  of 
its  implementation. 

Given  below  is  a  set  of  C  code  that  describes  the  actions  of  each  VLSI-PLM  instruction.  It  is  essen¬ 
tially  a  level  one  simulator  presented  instruction  by  instruction.  I  assume  that  the  reader  has  on  hand  a 
copy  of  The  Berkeley  PLM  Instruction  Set:  An  Instruction  Set  for  Prolog  [FaDo].  Warren’s  original 
definition  of  the  WAM  [War]  and  Dobry’s  thesis  Pob]  are  also  very  helpful.  Material  in  these  documents 
are  not  duplicated  here  unless  needed  for  completeness  or  to  point  out  corrections. 

The  code  is  not  a  direct  transliteration  of  the  VLSI-PLM  microcode.  I  have  rearranged  operations  if 
the  result  is  more  clear.  In  the  use  of  C,  I  have  restricted  myself  to  arithmetic  and  logic  operations,  assign¬ 
ments,  if  and  switch  statements,  jumps,  and  function  calls.  These  roughly  correspond  to  the  operations 
available  on  most  microarchitectures,  and  therefore  the  elemental  operations  are  less  likely  to  be  hidden  by 
C  constructs. 

2.  Fundamentals 

2.1.  Registers  and  Memory  Layout 

The  registers  and  state  bits  of  the  VLSI-PLM  are: 


P 

CP 

H 

B 

E 

TR 

HB 

N 

AO  through  A7 

CUT 

MODE 

PDL 

H2 


program  counter  (30-bit  byte  address) 

continuation  pointer  (return  address) 

heap  pointer 

choice  point  pointer 

environment  pointer 

trail  pointer 

heap  pointer  on  backtracking 
number  of  permanent  variables 
eight  argument  registers 
cut  bit 

mode  bit  (READ  or  WRITE) 
unification  push  down  list  pointer 
global  heap  pointer 


In  [FaDo]  the  argument  registers  where  specified  with  either  Ai  or  Xi .  In  this  document,  I  will  use  only 
Ai  and  number  the  registers  starting  from  0  ([FaDo]  and  the  PLM  compiler  both  start  the  numbering  from 
1).  Similarly,  Yi  addressing  starts  from  zero  (rather  than  one  as  in  the  compiler).  I  have  chosen  this  con¬ 
vention  since  it  matches  the  actual  value  used  in  the  argument  byte  of  the  machine  instruction. 
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The  program  counter  (P )  is  not  contained  in  the  VLSI-PLM,  but  instead  is  kept  in  the  external 
instruction  prefetch  unit.  The  VLSI-PLM  can  request  a  new  value  for  P  by  either  sending  the  entire  new 
30-bit  value  (as  in  proceed)  or  by  sending  an  8-bit  offset  (as  in  switch_on_tenn).  The  current 
prefetch  unit  treats  offsets  as  positive  only  (as  shown  in  the  pseudo  code),  however,  with  a  change  to  the 
prefetch  unit  (and  assembler)  dtis  could  be  changed  to  signed  offsets. 

Choice  point  structure : 

< -  B 

TR 
P 

CP 
E 

A7 
A6 
A5 
A4 
A3 
A 2 
A1 
AO 
N 
H 
B 


Structure  of  environment : 

Yn 

Y1 

YO  < -  E 

N 

CP 

COT  I  B 
E 


Structure  of  trail: 


VALOE 

NONVAP.  |  ADDR 
VAR  I  X 
VAR  I  Y 

< - TR 

Unification  of  nested  structures  requires  a  push  down  list  On  the  VLSI-PLM  this  is  supplied  as  an 
on-chip  circular  buffer  (with  automatic  spill  to  memory  on  overflow).  This  mechanism  is  not  included  in 
die  following  pseudo-code  since  it  would  overly  complicate  the  unify  algorithm.  Instead,  I  assume  the 
existence  of  two  arrays,  PDL1  and  PDLr ,  and  an  index  pointer,  FDL.  Since  a  very  small  fraction  of  the 
total  execution  time  is  spent  in  the  general  unify  routine,  these  arrays  could  be  placed  on  top  of  the 


I 

I  addraas/value  pair  from  setarg/3 
normal  trail  entries 


(points  at  one  position  beyond  end  of  choice  pt) 


|  higher  addresses 
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environment  or  heap  for  the  duration  of  the  unify  routine  with  little  loss  of  performance. 

In  the  pseudo  code,  the  comparison  ADDRESS  Yi  is  often  used.  This  does  not  refer  to  real 
registers  or  state  bits,  but  reflects  the  decoding  of  the  opcode  of  the  current  instruction.  If  the  opcode  indi¬ 
cates  that  the  addressing  mode  refers  to  a  register  (xi  ),  then  this  test  would  be  false.  Otherwise,  if  the 
addressing  mode  refers  to  a  permanent  variable  (Yi  ),  then  the  test  succeeds. 

2.2.  Cdr  coding 

The  primary  rule  for  handling  cdr  bits  is:  the  cdr  bit  is  associated  with  the  memory  location,  not  the 
data  value.  Cdr  bits  (that  have  meaning)  can  appear  only  on  the  heap  and  trail,  and  can  only  be  created  by 
the  unify_nil  and  unify_cdr  instructions.  The  cdr  bit  of  a  stack  variable  has  no  meaning,  since 
lists  and  structures  are  not  built  on  the  stack.  Cdr  bits  can  be  removed  from  a  given  location  only  when  that 
location  is  reclaimed  on  backtracking.  Binding  and  detrailing  do  not  change  the  value  of  the  cdr  bit.  Dur¬ 
ing  unification  the  cdr  bit  does  not  affect  the  match  between  two  constants. 

The  mixture  of  cdr  coding  and  variable  dereferencing  gives  rise  a  subtle  point  The  value  (and  tag) 
of  a  variable  is  determined  by  the  data  at  the  end  of  its  dereference  chain.  However,  the  cdr  bit  of  the  vari¬ 
able  itself  must  be  used  (rather  than  the  cdr  bit  of  the  dereferenced  value). 

The  Xenologic  XI  microcode  takes  the  position  that  the  cdr  bit  of  data  in  an  A  register  has  no  mean¬ 
ing,  and  therefore  is  always  cleared.  This  was  not  done  in  the  VLSI-PLM,  since  the  non-WAM  instruc¬ 
tions  allow  the  cdr  bit  to  be  manipulated  by  assemblv  code. 

23.  Instruction  Formats 

Although  Dobry’s  thesis  contains  explicit  information  about  instruction  formats,  a  brief  summary  of 
instruction  opcodes  and  arguments  is  given  here.  All  opcodes  are  a  single  byte.  Instructions  have  up  to 
three  arguments,  the  first  is  either  one  or  four  bytes  and  the  next  two  are  always  one  byte. 


Instruction 

Opcode  (hex) 

allocate 

00 

deallocate 

01 

proceed 

80 

cut 

trust_me_else 

fail 

82 

unify  _nil 

02 

nop 

04 

halt 

06 

reset 

81 

Instructions  with  no  arguments 


March  1, 1989 


-4- 


Instruction 

Opcode 

Argl 

switch_on_constant 

dO 

table  address 

hash  mask 

switch_on_structure 

dl 

try 

45 

destination  address 

retry 

46 

trust 

47 

try_me_else 

41 

continuation  address 

retry_me_else 

42 

cutd 

49 

continuation  address 

44 

destination  address 

4c 

call 

54 

destination  address 

new  N 

get_constant 

50 

constant 

A  register 

put_constant 

52 

functor 

A  register 

unify_constant 

40 

constant 

Instructions  with  four-byte  Argl 


Escape 

Opcode 

Argl 

External  (arity  0) 

4a 

00000100 

External  (arity  1) 

4a 

00000101 

External  (arity  2) 

4a 

00000102 

External  (arity  3-7) 

4a 

00000103 

Additional  regs  dumped 
External  (arity  0) 

4a 

00000180 

External  (arity  1) 

4a 

00000181 

External  (arity  2) 

4a 

00000182 

External  (arity  3-7) 

4a 

00000183 

X7  to  H2 

4a 

00000060 

H2  to  X7 

4a 

00000061 

X7  to  H 

4a 

00000062 

H  to  X7 

4a 

00000063 

X7  to  S 

4a 

00000064 

S  to  X7 

4a 

00000065 

X7  toB 

4a 

00000066 

B  to  X7 

4a 

00000067 

X7  to  E 

4a 

00000068 

EtoX7 

4a 

00000069 

X7  toTR 

4a 

0000006a 

TR  to  X7 

4a 

0000006b 

Escape  instructions  (Argl  is  four  bytes) 
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Instruction 

!  Opcode 

Argl 

Arg2 

Arg3 

switcb_on_term 

b4 

constant  offset 

list  offset 

structure  offset 

get_nil 

IMSilSf 

A  register 

get_list 

ii 

put_nil 

12 

put_list 

13 

get_variable 

20 

A  register 

A  register 

get_value 

21 

put_variable 

22 

put_value 

23 

get_variable 

28 

Y  index 

A  register 

get_value 

29 

put_variable 

2a 

put_value 

2b 

put_unsafe_value 

2c 

Y  index 

A  register  j 

umfy_variable 

15 

A  register 

unify_value 

17 

unify_cdr 

14 

! 

unify_variable 

■a 

Y  index 

unify_value 

■■ 

unify_cdr 

unify_void 

18 

count 

deref 

le 

A  register 

add 

24 

A  register  source 

A  register  destination 

sub 

25 

and 

26 

or 

27 

COT 

2d 

mult 

2e 

memread 

34 

A  reg  (addr  base) 

A  register  destination 

positive  offset 

mem  write 

35 

A  reg  (addr  base) 

A  register  source 

positive  offset 

coderead 

16 

A  reg  (address) 

codewrite 

19 

A  reg  (address) 

jeq 

b5 

A  register 

A  register 

positive  branch  offset 

jh 

b6 

jle 

b7 

jumpxn 

90 

A  register 

| 

loadn 

_i! _ 

new  N 

| 

Instructions  with  one -byte  Argl 


3.  Details  of  Pseudo  Code 

Constants  and  user  visible  registers  are  given  names  consisting  of  all  capital  letters.  The  constants 
VAR,  CON,  LST,  and  STR  represent  the  tag  values  for  variables,  constants,  lists,  and  structures,  respec¬ 
tively.  Temporary  variables  are  usually  specified  by  lower  case,  e.g.  tO,  valtO,  etc.  These  tem¬ 
poraries  do  not  necessarily  correspond  to  scratch  registers  in  the  microarchitecture. 

In  the  VLS1-PLM,  logic  operations  and  equality  tests  are  32-bit  Arithmetic  operations  and  inequal¬ 
ity  tests  operate  on  only  the  value  field  (28  bits).  The  pseudo  code  below  adopts  the  usual  meaning  of  these 
operators  in  C  (working  on  32-bit  signed  integers).  Explicit  use  of  value  along  with  an  operation  or 
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comparison  signifies  the  fact  that  that  operation  or  comparison  is  28-bit  (thus  the  comparisons  are 
unsigned). 

There  are  several  basic  operations  that  often  occur.  I  define  them  here  in  terms  of  arithmetic  and 
logic  operations,  but  the  microarchitecture  directly  supports  them. 


tag(tO) 

{ 

return  (to  »  30)  £  3; 

} 

cdr  (tO) 

{ 

return  (tO  »  29)  £  1; 

} 


/*  return  bits  <31:30>  */ 


/*  return  bit  <29>  */ 


value (tO) 

{ 

return  to  £  OxOfffffff; 

} 


/*  return  bits  <27:0>  */ 


construct (tag,  cdr,  val) 

{  /*  gc  bit  is  cleared  */ 

tO  «  (tag  «  30)  £  OxcOOOOOOO; 
tl  -  (cdr  «  29)  £  0x20000000; 
t2  -  val  £  OxOfffffff; 

return  (to  |  tl  I  t2) ; 

1 


In  addition,  the  function  memread(tO)  performs  a  memory  read  from  the  address  given  by 
value  (tO)  ,  and  returns  the  entire  32  bits  at  this  address.  The  function  memwrite  (tO,  tl)  per¬ 
forms  a  memory  write  to  the  address  given  by  value  (tO)  ,  storing  the  entire  32  bits  of  tl  at  this 
address.  The  functions  coderead  and  codewrite  are  similar,  but  read  and  write  to  the  code  address 
space  (which  is  separate  from  die  data  address  space). 

4.  Basic  Operations 

4.1.  Dereference 

This  dereference  routine  assumes  that  tO  is  a  variable.  The  VLSI-PLM  has  a  conditional  micro- 
subroutine  branch  based  on  the  tag  value,  and  so  this  is  the  natural  thing  to  do.  An  alternative  is  to  test  1 0 
in  dereference  and  return  immediately  if  the  1 0  is  not  a  variable. 

One  must  beware  of  the  case  when  the  last  element  of  a  dereference  rhnin  is  an  unbound  variable, 
and  it  and  the  previous  link  (the  variable  pointing  to  the  unbound  variable)  have  differing  cdr  bits.  The 
code  given  here  is  conservative  and  will  dereference  an  extra  time  in  this  case. 
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dereference (tO) 
{ 

LI: 


/*  assumes  that  on  entry:  tag(tO) 


tl  «  memread (tO)  ; 

if  (tO  “  tl  ||  tag(tl)  !«  VAR)  return  tl; 
tO  -  tl; 
goto  LI; 


VAR  */ 


42.  Trail 

It  is  important  that  the  entire  32  bits  of  the  variable  be  written  to  the  trail  stack,  since  the  detrail  rou¬ 
tine  (in  fail)  restores  the  value  by  a  32-bit  transfer. 

Although  the  trail  grows  toward  low  memory,  the  trail  pointer,  TR ,  points  at  the  next  available  loca¬ 
tion  (rather  than  the  top  entry  of  the  stack). 


trail (tO) 

1 

/*  assumes  that  tO  is  the  unbound  variable  to  be  trailed  */ 
valtO  «  value (tO) ; 

if  ( (valtO  <  value (B)  ££  valtO  >  value  (H) )  |  |  /*  unbnd  var  on  stack  */ 

valtO  <  value (HB) )  {  /*  unbnd  var  on  heap  *7 

memwrite (TR,  tO ) ; 

TR—  ; 


) 


} 


43.  Fail 

The  VLSI-PLM  does  not  have  variable  sized  choice  points,  so  all  of  the  argument  registers  are 
saved,  even  if  it  is  not  necessary.  This  makes  garbage  collection  harder,  since  one  does  not  know  which 
values  are  valid  starting  points  for  marking.  A  possible  solution  is  to  store  the  number  of  valid  argument 
registers  in  X7 ’s  position  (X7  is  used  by  the  compiler  only  as  a  temporary). 

The  detrailing  done  in  fail  must  be  done  by  popping  the  trail  stack.  This  is  important  because  several 
setarg  entries  may  modify  the  same  memory  location,  and  the  oldest  trail  entry  should  be  restored.  The 
setarg  trail  entry  consists  of  an  address/value  pair.  The  address  must  have  a  non-variable  tag  to  dif¬ 
ferentiate  it  from  a  normal  (unbound  variable)  entry. 
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fail () 

{ 


LI: 


} 


PDL  -  0; 

if  (value (B)  “  value (STKbase) )  { 

goalf ail ( ) ; 
return; 

} 

n«wTR  *=  memread (B  -  1); 

P  m  memread(B  -  2); 

CP  «  memread (B  -  3); 

E  «  memread (B  -  4); 

A[7]  ■  memread (B  -5); 

A [6]  “  memread (B  -  6); 

A[5]  ■=  memread(B  -7); 

A [4]  ■  memread (B  -  8)  ; 

A[3]  **  memread  (B  -9); 

A [2]  =  memread (B  -  10); 

A[l]  *  memread (B  -  11); 

A[0]  «  memread (B  -  12); 

N  «  memread (B  -  13); 

H  ■  memread (B  -  14); 

if  (value (newTR)  value (TR) )  return; 

TR++ ; 

tO  «  memread (TR) ; 
if  (tag(tO)  —  VAR)  { 

memwrite (to,  tO);  /»  normal  trail  entry  */ 

}  else  { 

TR++;  /*  2  word  entry  for  setarg  */ 

tl  m  memread (TR); 
memwrite (tO,  tl); 

} 

goto  LI; 


5.  Indexing  Instructions 
5.1.  Switch  on  Term 

This  instruction  takes  three  8-bit  arguments  that  give  the  (positive  only)  brand]  displacement  for  the 
constant,  list,  and  structure  cases.  The  variable  case  continues  execution  with  the  next  statement  The 
maximum  value  for  die  displacement  (2S5)  indicates  failure. 
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switch_on_tern\ ( ) 

{ 

tO  =  A  [0 ]  ; 

if  (tag(tO)  “  VAR)  tO  *  dereference (tO) ; 

switch  (tag(tO))  { 
case  VAR: 

return; 
case  CON: 

tl  “  argl; 
break; 
case  LST: 

tl  *  arg2; 
break; 
case  STR: 

tl  «  arg3; 
break; 

) 

if  (tl  :  Oxff)  { 
fail ( ) ; 
return; 

} 

P  «  P  +  tl; 


5.2.  Switch  on  Constant  and  Structure 

Both  of  these  instructions  take  a  28-bit  code  space  word  address  (argl )  which  specifies  the  beginning 
location  of  the  hash  table,  and  an  8-bit  mask  (arg2)  which  is  anded  with  the  constant  or  functor  value  as  the 
hash  function. 


switch_on_constant ( ) 

< 

tO  «  A  [0  ]  ; 

if  (tag(tO)  VAR)  tO  «  dereference (tO) ; 
switch (tO )  ; 

} 


switch_on_structure () 

1 

tO  -  A [0] ; 

if  (tag(tO)  VAR)  tO  “  dereference (tO) ; 

func  •  memread (tO) ;  /*  functor  of  structure  */ 

switch (func) ; 


These  switch  instructions  utilize  open  addressing  hashing  with  linear  probing.  The  functor  or  con¬ 
stant  value  is  multiplied  by  two  so  that  consecutive  integers  will  map  to  consecutive  table  entries.  The  hash 
function  is  given  by  the  functor  or  constant  value  anded  with  the  mask  (arg2). 
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The  hash  table  consists  of  two  word  entries,  a  value  and  a  code  address.  If  the  value  entry  matches 
the  value  of  the  constant  or  functor,  then  execution  proceeds  at  the  code  address.  The  cdr  bit  of  the  value 
entry  is  used  to  denote  unused  table  locations  and  a  valid  entry  just  before  an  unused  location  The  value 
of  an  empty  entry  must  be  unique  (different  from  all  constants  and  functors)  since  probing  can  begin  at  an 
empty  entry.  Failure  occurs  when  all  the  table  entries  have  been  unsuccessfully  checked  or  if  after  looping 
to  the  top  of  the  table,  the  cdr  bit  of  the  value  is  set. 


/ 


switch (tO ) 

{ 

tl  -  tO  «  1; 
tl  «  tl  i  arg2; 
start  «=  argl  +  tl; 
ptr  «  start; 

LI; 

t2  «  coderead (ptr ) ; 
if  (value (t2)  “  value (tO) )  { 

P  •=  coderead  (ptr  +  1)  ; 
return; 

5 

ptr  “  ptr  +2; 
if  (cdr(t2)  “  0)  goto  LI; 


/*  pointer  into  hash  table  */ 


ptr  «  argl;  /*  start  over  at  beginning  of  table  */ 

L2 : 

if  (ptr  >«  start)  { 
fail () ; 
return; 

1 

t2  «  coderead (ptr) ; 
if  (value (t2)  —  value (tO))  { 

P  «  coderead (ptr  +  1); 
return; 

1 

ptr  «  ptr  +  2; 
if  (cdr(t2)  0)  goto  L2; 


fail()  ; 
return; 


There  are  several  improvements  that  can  be  made  with  tins  instruction.  The  anding  of  arg2  should  be 
done  before  the  left  shift,  allowing  tables  with  256  entries.  Also,  instead  of  failing  when  a  hash  lookup  is 
unsuccessful,  execution  should  proceed  to  the  next  instruction,  allowing  several  switch  instructions  to  be 
placed  one  after  another  (this  is  what  is  dooe  in  the  XI).  This  is  useful  for  hash  tables  with  more  than  128 
entries.  In  addition,  eliminating  the  code  which  probes  from  the  beginning  of  the  table  would  make 
modification  of  the  hash  table  easier  during  assert  and  retract  Finally,  the  cdr  bit  should  be  set  only  for 
empty  entries,  eliminating  the  need  for  unique  values  in  empty  entries.  These  changes  are  summarized 
below  with  new  pseudo  code. 
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switch (tO)  /*  not  implemented  */ 

{ 

tl  ■  tO  J  arg2; 
tl  -  tl  «  1; 

ptr  ■=  argl  +  tl;  /*  pointer  into  hash  table  */ 

LI: 

t2  *=  coderead (ptr) ; 

if  (cdr(t2)  “  1)  goto  L2;  /*  empty  if  cdr  set  */ 

if  (value  (t2)  <“  value (tO))  { 

P  «  coderead (ptr  +  1); 
return; 


) 

ptr  *  ptr  +  2; 
goto  LI; 

L2 : 

return; 


} 


/*  if  no  match,  continue  with  instruction  */ 
/*  following  the  switch  */ 


6.  Procedure  Control  Instructions 
6.1.  Try 
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try  () 

< 

CUT  -  1; 

if  (value (E)  <  value (B) )  { 
tO  «=  B; 

)  else  { 

tO  -  E  +  N; 

) 

memwrite (tO,  B)  ; 

memwrite (tO  +  1,  H)  ; 

memwrite (tO  +  2,  N) ; 

memwrite (tO  +  3,  A[0]); 

memwrite (tO  +  4,  A[l]); 

memwrite (to  +  5,  A[2]); 

memwrite (tO  +  6,  A[3]); 

memwrite (tO  +  7,  A  [4]); 

memwrite (tO  +  8,  A[5]); 

memwrite (tO  +  9,  A [6]); 

memwrite (tO  +  10,  A[7]); 
memwrite (to  +  11,  £); 
memwrite (tO  +  12,  CP); 
memwrite (tO  +  13,  P); 
memwrite (tO  +  14,  TR) ; 

B  -  tO  +  15; 

HB  -  H; 

P  «  argl; 

) 


6 J2.  Retry 

retry  () 

( 

CUT  -  1; 

memwrite (B  -  2,  P)  ; 
P  “  argl; 

} 


6.3.  Trust 

trust  () 

{ 

CUT  «  0; 
tO  *  B; 

B  ■  memread(tO  -  15); 

HB  -  memread (tO  -  14);  /*  not  the  optimal  value  for  HB  */ 

P  -  argl; 

) 
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The  HB  register  is  not  updated  optimally  in  the  PLM.  Whenever  choice  points  are  discarded,  the 
HB  register  is  restored  first,  and  then  the  choice  point  is  thrown  away.  What  should  happen  is  that  the 
choice  point  should  be  discarded  first,  and  then  the  HB  register  loaded  from  the  newly  activated  choice 
point  on  top  of  the  stack. 

The  current  HB  register  updating  scheme  will  give  correct  operation,  but  will  trail  more  often  than 
necessary.  Fixing  the  level  1  simulator  reduced  the  number  of  trails  on  the  quicksort  benchmark  by  more 

than  half 

If  the  last  choice  point  cm  the  stack  is  discarded,  and  then  the  PLM  attempts  an  HB  register  update, 
then  the  HB  register  gets  loaded  with  garbage.  This  is  because  the  HB  register  is  always  updated  from 
the  H  field  of  the  current  choice  point;  if  their  are  no  choice  points  on  the  stack  then  the  value  obtained  in 
this  way  is  obviously  not  meaningful. 

However,  this  doesn’t  affect  the  operation  of  the  PLM,  provided  the  address  generated  when  the 
PLM  attempts  to  update  the  HB  from  the  nonexistent  choice  point  is  accepted  by  the  memory  system. 
Consider  for  a  moment  why  this  must  be  so.  Only  two  incorrect  actions  are  possible  as  a  result  of  an 
incorrect  HB  value:  either  a  value  is  trailed  that  should  not  be,  or  a  value  is  not  trailed  that  should  be.  In 
either  case,  the  undesirable  effects  will  not  be  felt  until  failure  occurs,  and  the  trail  is  unwound.  However, 
the  trail  section  built  with  the  garbage  HB  value  was  placed  on  the  trail  stack  when  there  was  no  choice 
point,  so  when  failure  occurs  the  PLM  reports  top  level  failure  and  quits. 

This  problem  didn’t  arise  before  because  the  HE  register  was  updated  from  the  last  choice  point  dis¬ 
carded  from  the  stack,  thus  avoiding  a  special  check  for  the  top  of  the  stack  every  time  a  choice  point  was 
discarded  to  prevent  HB  from  getting  a  garbage  value.  (The  above  comments  are  originally  from  Bany 
Fagin). 

The  problem  with  loading  HB  with  garbage  can  be  eliminated  by  initializing  the  choice  point  stack 
with  a  “sentinel”  choice  point.  This  choice  point  would  contain  the  initial  values  of  all  the  stack  pointers 
and  the  code  address  of  the  routine  for  goal  failure.  If  this  scheme  were  used,  the  check  B  — 
STKbase  in  fail  could  be  eliminated. 

6.4.  Try  me  Else 
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try_me_else () 

{ 

CUT  ■=  1; 

if  (value (E)  <  value (B) )  ( 
tO  =  B; 

}  else  { 

tO  -  E  +  N; 

} 


memwrite (to, 

B)  ; 

memwrite (tO 

+ 

1,  H); 

memwrite (tO 

4 

2,  N); 

memwrite (tO 

4 

3,  A [ 0 ] )  ; 

memwrite (tO 

4 

4,  A [ 1 ] )  ; 

memwrite (tO 

4 

5,  A [2 ] )  ; 

memwrite (tO 

4 

6,  A[3] ) ; 

memwrite (tO 

4 

1,  A [4 )  )  ; 

memwrite (tO 

4 

6,  A [5] )  ; 

memwrite (tO 

4 

9,  A  [  6]  )  ; 

memwrite (tO 

4 

10,  API); 

memwrite (tO 

4 

11,  E); 

memwrite (tO 

4 

12,  CP); 

memwrite (tO 

4 

13,  argl); 

memwrite (to 

4 

14,  TR) ; 

B  -  to  +  15; 
HB  «  H; 


6i.  Retry  me  Else 

retry_me_else ( ) 

{ 

CUT  «  1; 

memwrite (B  -  2,  argl); 


6.6.  Trust  me  Else 

trust_me_else () 

{ 

CUT  -  0; 
tO  -  B; 

B  *  memread (tO  -  15); 
HB  “  memread (tO  -  14); 


/*  not  the  optimal  value  for  HB  */ 
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6.7.  Cut 

cut  () 

{ 

/*  must  get  rid  of  cdr  (cut)  bit  */ 

/*  not  the  optimal  value  for  HB  */ 


6.8.  Cutd 

cutd ( ) 

{ 

LI: 

tO  =  B; 

B  «  memread (tO  -  15)  ; 
tl  =  memread (tO  -  2); 

if  (tl  ! *=  argl)  goto  LI; 
HB  «  memread(tO  -  14); 


tO  •=  memread  (E  -  3)  ; 

B  “  value (tO ) ; 
if  (cdr (tO )  “1)  ( 

B  «  memread (tO  -  15); 
HB  “  memread (tO  -  14); 


7.  Clause  Control  Instructions 
7.1.  Proceed 

proceed ( ) 

{ 

P  -  CP; 

COT  -  0; 

if  (value (CP)  0)  goalsuccess () ; 


The  test  for  CP  being  a  null  value  could  be  eliminated  if  CP  is  initialized  to  point  to  die  code  for  goal 
success. 

7.2 .  Execute 

execute  () 

{ 

P  «  argl; 

COT  «  0; 
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7-3.  Call 

call ( ) 

{ 

CP  -  P; 

P  ■  argl; 
N  «  arg2 ; 
CUT  =  0; 

} 


7.4.  Allocate 


allocate  ( ) 

{ 

if  (value (E)  <  value  (B) )  { 

tO  «  B; 


1 


}  else  { 

tO  •=  E  +  N; 


1 

memwrite (tO , 
memwrite (tO  +  1, 

memwrite (tO  +  2, 
memwrite (tO  +  3, 
E  «  to  +  4; 


E)  ; 

construct (0,  CUT,  B) ) ;  /*  record  the  CUT  bit  */ 

/*  as  the  cdr  bit  of  B  in  the  environment  */ 


CP)  ; 
N)  ; 


IS.  Deallocate 


deallocate ( ) 

{ 

N  m  memread (E  -  1)  ; 
CP  •  memread (E  -  2); 
E  •  memread (E  -  4); 


8.  Get  Instructions 
8.1.  Get  Variable 
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get_variable ( ) 

{ 

if  (ADDRESS  «=  Yi)  { 

menwrite(E  +  argl,  A[arg2]); 

}  else  { 

A  [argl]  «  A[arg2]; 

) 

} 


8.2.  Get  Value 

get_value ( ) 

( 

if  (ADDRESS  —  Yi)  { 

tO  =  memread (£  +  argl); 

)  else  { 

tO  «  A [argl] ; 

} 

tl  «  A [arg2] ; 

if  (tag(tO)  ««  VAR)  tO  «=  dereference (tO) 
if  (tag(tl)  —  VAR)  tl  •  dereference  (tl) 
if  (ADDRESS  !«  Yi)  A[argl]  *=  tl; 
unify (to,  tl) ; 


8.3.  Get  Constant 


get_constant  ( ) 

{ 

tO  =  A[arg2]; 

if  (tag(tO)  VAR)  tO  «  dereference (tO) 
unify (tO,  argl); 

} 


8.4.  Get  NU 

get_nil ( ) 

i 

to  ■  A[argl] ; 

if  (tag(tO)  VAR)  tO  ■  dereference (tO) 
unify (tO,  NIL); 

) 
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8.5.  Get  Structure 


get_structure () 

< 

tO  *  A [arg2] ; 

if  (tag(tO)  “  VAR)  tO  «  deref erence (tO) ; 
switch  (tag(tO))  { 
case  CON: 
case  LST: 
fail ( ) ; 
return; 
case  VAR: 


} 


MODE  ■=  WRITE; 

memwrite (tO,  construct ( STR,  cdr(tO),  H) ) ; 
trail (tO) ; 
memwrite (H,  argl) ; 

H++ 
break; 
case  STR: 

MODE  «  READ; 
tl  «  memread(tO); 

if  (tl  !■=  argl)  (  /*  32-bit  compare  works  because  cdr  bit  */ 

/*  is  always  clear  on  functor  */ 

fail  ()  ; 
return; 

) 

S  «  value (tO )  +  1; 
break; 


8.6.  Get  List 
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get_list  () 

< 

tO  «  A [argl] ; 

if  (tag(tO)  “  VAR)  to  ■=  dereference (tO) ; 

■witch  (tag(tO))  { 
case  CON : 
case  STR: 
failO  ; 
return; 
case  VAR: 

MODE  ■=  WRITE; 

memwrite (tO ,  construct (LST,  cdr (tO) ,  H) ) 
trail (tO) ; 
break; 
case  LST: 

MODE  «  READ; 

S  •=  value  (tO  ); 
break; 

) 

) 


9.  Put  Instructions 
9.1.  Put  Variable 

put_variable ( ) 

{ 

if  (ADDRESS  —  Yi)  { 

tO  *  construct (VAR,  0,  E  +  argl); 
memwrite (E  +  argl,  tO)  ; 

)  else  { 

tO  *  construct (VAR,  0,  H) )  ; 
memwrite (H,  tO) ; 

H++; 

A [argl]  -  tO; 

) 

A[arg2]  -  tO; 

) 


92.  Put  Value 
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put_value ( ) 

{ 

if  (ADDRESS  —  Yi)  { 

A[arg2]  •=  manure  ad  (E  +  argl); 

}  else  { 

A[arg2]  «  A [argl]; 

) 

} 


9.3.  Put  Unsafe  Value 

put_unsaf e_value ( ) 

{ 

to  «  memread(E  +  argl); 

if  (tag(tO)  VAR)  to  «  dereference (tO) ; 
if  (tag(tO)  !«  VAR  ||  value (tO)  <  value  (E) )  ( 

A[arg2]  «  tO; 
return; 

} 

tl  ■  construct (VAR,  0,  H)  ; 
memwrite(H,  tl)  ; 

H++; 

A[arg2]  ■■  tl; 
memwr ite ( tO ,  t 1 ) ; 
trail (tO) ; 

} 


9.4.  Put  Constant 

put_constant ( ) 

{ 

A[arg2]  ■  argl; 

) 


9.5.  Put  NU 

put_nil ( ) 

{ 

A[argl]  -  NIL; 

) 


9.6.  Put  Structure 
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put_structure ( ) 

{ 

MODE  -  WRITE; 

A[arg2]  *=  construct  (STR,  0,  H)  ; 
memwrite (H,  argl) ; 

H++; 

} 


9.7.  Put  List 

put_list  () 

{ 

MODE  •=  WRITE; 

A  [argl]  «=  construct  (LST,  0,  H)  ; 

} 


10.  Unify  Instructions 

It  is  very  important  that  before  calling  the  general  unify  routine  given  below,  that  both  arguments  are 
fully  dereferenced. 
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unify(tO,  tl) 

{ 

/*  tO  and  tl  must  be  dereferenced  on  entry  */ 


LI: 

if  (tag (tO )  !«  VAR  it  tag(tl)  !«  VAR)  {  /*  neither  is  a  VAR  */ 

if  (tag  (tO)  !-  tag(tl))  { 
fail()  ; 
return ; 

} 

if  (tag(tO)  «  CON)  ( 

if  (value (tO)  !-  value (tl) )  ( 

fail  (>  ; 
return; 


} 

if  (value (PDL)  “■=  0)  return; 
goto  L2; 

)  else  { 


t2  *  memread(tO); 
t3  “  memread(tl); 


t0++; 


tl++; 


PDL++; 


PDL1 [PDL]  -  tO; 

PDLr [PDL]  •  tl; 
tO  *  t2 ; 
tl  *  t3; 

if  (tag(tO)  «**  VAR)  tO  *  dereference (tO) ; 
if  (tag(tl)  •»“  VAR)  tl  *  dereference  (tl)  ; 
goto  LI; 

) 

)  else  if  (tag (tO)  —=  VAR  it  tag(tl)  —  VAR)  {  /*  both  are  VAR  */ 

if  (value (tO)  <  value (tl))  ( 

memwrite(tl,  construct (tag (tO) ,  cdr(tl),  tO) ) ; 
trail (tl ) ; 

}  else  { 

memwrite (tO,  construct (tag (tl) ,  cdr(tO),  tl)); 
trail (tO ) ; 

) 

)  else  {  /*  one  is  a  VAR  V 

if  (tag (tO)  —  VAR)  { 

memwrite (tO,  construct (tag (tl) ,  cdr(tO),  tl)); 
trail (tO ) ; 

)  «1»«  ( 

memwrite (tl,  construct (tag (tO) ,  cdr(tl),  tO)); 
trail (tl) ; 

} 

} 

if  (value (PDL)  0)  return; 

L2:  /*  PDL  is  not  empty  */ 

tO  -  PDL1 [PDL] ; 
tl  «  PDLr [PDL]; 

PDL— ; 

t2  “  memread(tO); 
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t3  •=  memread (tl) ; 
if  (cdr(t2)  «=  1)  { 

if  (tag(t2)  “  VAR)  t2  »=  dereference (t2) ; 
tO  “  t2; 

) 

if  (cdr(t3)  —  1)  { 

if  (tag(t3)  «  VAR)  t3  ■=  dereference  (t3 ) ; 
tl  «  t3; 

) 

goto  LI; 

} 


10.1.  Unify  Void 


unify_void ( ) 

{ 

count  argl; 
if  (MODE  «=  READ)  { 

Lis 


if  (count  “  0)  return; 
tO  «  memread (S); 
if  (cdr(tO)  ««  1)  { 

if  (tag(tO)  «  VAR)  tO  *  dereference  (tO) ; 
switch  (tag(tO))  { 
case  CON: 
case  STR: 
fail  ()  ; 
return; 
case  VAR: 

MODE  -  WRITE; 

memwrite(tO,  construct (LST,  cdr(tO),  H) ) 
trail (tO ) ; 
goto  L2 ; 
case  LST: 

S  ■  value (tO) ; 

} 

} 

count--; 

S++; 

goto  LI; 

}  «1»«  { 

L2 : 

if  (count  ««  0)  return; 

»emwrite(H,  constsruct  (VAR,  0,  H) )  ; 
count--; 

H++; 

goto  L2; 

) 

) 
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10.2.  Unify  Value 

unify_value () 

{ 

if  (ADDRESS  -=  Yi)  { 

tO  c  memread(E  +  argl); 

}  else  { 

tO  «*  A  [argl]  ; 

> 

if  (tag(tO)  ■*»=  VAR)  tO  ■=  dereference  (tO)  ; 
if  (MODE  —  READ)  { 

tl  «  memread(S); 
if  (cdr(tl)  -* =  1)  { 

if  (tag(tl)  “  VAR)  tl  “  dereference (tl) ; 
switch  (tag(tl))  { 
case  CON: 
case  STR: 
fail()  ; 
return; 
case  VAR: 

MODE  »  WRITE; 

memwrite (tl,  construct (LST,  cdr(tl),  H) ) ; 
trail (tl) ; 
goto  LI ; 
case  LST: 

S  -  tl; 

tl  “  memread (S) ; 

) 

} 

S++ ; 

if  (tag(tl)  «  VAR)  tl  *  dereference (tl ) ; 

unify (tO,  tl) ; 

return; 

}  else  { 

LI: 

if  (tag(tO)  VAR  it  value  (tO)  >  value (H) )  { 
tl  «  construct (VAR,  0,  H) ; 
memwrite (H,  tl) ; 

memwrite (tO,  tl);  /*  clearing  cdr  is  OK — on  stack  */ 

trail (tO) ; 

}  else  { 

memwrite (H,  construct (tag (tO) ,  0,  tO) )  ; 

/*  make  sure  cdr  bit  is  clear!  */ 

} 

H++; 

} 

} 


The  PLM  compiler  produces  the  mstructiorL  unify_unsafe  value.  For  executioo  by  the 
VLSI-PLM,  this  instructioD  should  be  replaced  with  uni f  y_value  (die  assembler  could  do  this). 

In  write  mode,  if  the  argument  of  unify_value  dereferences  to  an  unbound  variable  on  the 
stack,  then  it  would  be  wrong  for  this  to  be  simply  copied  to  the  top  of  the  heap  (since  a  variable  link  would 
then  point  upwards).  Instead,  a  new  unbound  variable  is  created  on  the  heap  and  is  bound  to  the  variable 
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on  the  stack,  the  binding  is  trailed  if  necessary.  Thus,  our  unify_value  is  Wanen’s 
unify_local_value . 

There  is  no  version  of  unify_value  without  the  check  for  an  unbound  variable  on  the  stack 
because  one  can  never  tell  at  compile  time  whether  the  check  can  be  safely  eliminated.  The  following  code 
illustrates  this  point 


main  a (X) ,  d(X),  e,  write (X) ,  nl. 
a  (Y)  c  (X,  Y)  ,  b  (X)  . 
b(_)  . 

c(X,  [X]).  %  called  from  a/1  with  X  set  to  unbnd  var  on  stack 

d  ( [x]  )  . 

e  b(X) ,  b (X) . 


10J.  Unify  Variable 
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unify_variable ( ) 

{ 


if  (MODE  ■=  READ)  ( 

tO  *=  memread(S)  ; 

if  (cdr(tO)  «=  1)  {  /*  S  points  to  cdred  data  */ 

if  (tag(tO)  ■*=  VAR)  tO  *  dereference (tO) ; 
switch  (tag(tO))  { 
case  CON: 
case  STR: 
fail <) ; 
return; 
case  VAR: 

MODE  -  WRITE; 

memwrite (tO,  construct (LST,  cdr(tO),  H) ) ; 
trail (tO) ; 
goto  LI ; 
case  LST: 

S  =  value  (tO) ; 
tO  •  memread(S) ; 

} 

} 

S++ ; 

if  (tag(tO)  mm  VAR)  tO  ■  dereference  (tO) ; 
if  (ADDRESS  «■=  Yi) 

memwrite (E  4  argl,  tO) ; 

else 

A [argl]  “  tO; 


LI: 


}  else  { 


/*  MODE 


WRITE  */ 


tO  «  construct (VAR,  0,  H) ; 
memwrite (H,  tO) ; 

H44  ; 

if  (ADDRESS  —  Yi) 

memwrite (E  4  argl,  tO) ; 

else 

A [argl]  -  tO; 


/*  push  unbound  var  on  heap  */ 


10.4.  Unify  Constant 
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unify_constant ( ) 

{ 

if  (MODE  «*=  READ)  { 

tO  *=  memread  (S)  ; 
if  (cdr(tO)  — =  1)  ( 

if  (tag(tO)  —=  VAR)  tO  «  dereference  (tO) ; 
switch  (tag(tO))  { 
case  CON: 
case  STR: 
fail  ( ) ; 
return; 
case  VAR: 

MODE  -  WRITE; 

memwrite(tO,  construct (LST,  cdr(tO),  H) ) 
trail (tO) ; 
goto  LI; 
case  LST: 

S  =  value (tO) ; 
tO  =  memread(S); 

} 

} 

S++; 

if  (tag(tO)  «  VAR)  tO  «  dereference (tO ) ; 

unify (tO,  argl) ; 

return; 

}  else  (  /*  MODE  ->«  WRITE  «/ 

LI: 

memwrite (H,  argl); 

H++; 

} 

} 


10.5.  Unify  Cdr 
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unify_cdr {) 

{ 

if  (MODE  —  READ)  ( 

tO  *=  memread(S); 

if  (cdr(tO)  “  0)  tO  *  construct (LST,  0,  S) 

)  else  { 

tO  *  construct (VAR,  1,  H)  ; 
memwrite (H,  tO)  ; 

H++; 

} 

if  (ADDRESS  — =  Yi)  { 

memwrite (E  +  argl,  tO); 

}  else  { 

A [argl]  «  tO; 

} 

) 


10.6.  Unify  Nil 

unify_nil () 

1 

if  (MODE  —  READ)  ( 

tO  «=  memread(S); 
if  (cdr(tO)  •*  0)  ( 

fail() 
return ; 

} 

if  (tag(tO)  mm  VAR)  tO  m  dereference  (tO) ; 

unify (tO,  NIL); 

return; 

)  else  { 

memwrite(H,  construct (CON,  1,  NIL)); 

H++  ; 

} 

} 


11.  VLSI-PLM  Specific  Instructions 
11.1.  Deref 


deref () 

{ 

to  ■  A [argl]; 

if  (tag(tO)  mm  VAR)  tO  •  dereference (tO) ; 
A [argl]  «  tO; 

) 
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11.2.  Add 

add() 

{ 

tO  *  A [argl] ; 
tl  «  A [arg2] ; 

A[arg2]  •=  construct  (CON,  0,  tO  +  tl)  ; 

} 


1L3.  Sub 

sub  () 

{ 

tO  ■  A [ argl ] ; 
tl  -  A[arg2] ; 

A[arg2)  «=  construct  (CON,  0,  tl  -  tO)  ; 

) 


11.4.  Mult 

Multiply  does  an  unsigned  multiply  of  two  27-bit  integers  to  produce  a  27-bit  result.  If  the  result 
requires  more  than  27-bits,  then  an  overflow  is  indicated  by  setting  the  answer  to  the  constant  NIL.  To 
count  the  number  of  iterations,  one  is  shifted  left  27  times. 

mult ( ) 

{ 

multiplier  “  A [argl]; 
multiplicand  «  A[arg2]; 
accum  =0; 
cnt  ■=  1; 

LI: 

accum  •  accum  «  1; 

if  (accum  t  0x08000000  «  1)  goto  ovrflw; 
multiplier  «  multiplier  «  1; 
if  (multiplier  t  0x08000000  1)  { 

accum  “  accum  +  multiplicand; 

if  (accum  £  0x08000000  1)  goto  ovrflw; 

1 

cnt  “  cnt  «  1; 

if  (cnt  i  0x08000000  0)  goto  LI; 

A(arg2]  •  construct (CON,  0,  accum); 
return; 

ovrflw: 

A[arg2J  ■  NIL; 
return; 

} 
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11.5.  And 


»nd() 

{ 

tO  *  A [argl] ; 
tl  -  A[arg2]; 
A[arg2]  •  to  (  tl; 


/*  32-bit  operation  ! !  */ 


11.6.  Or 


or  () 

{ 

tO  «=  A  [argl]  ; 
tl  «=  A  [arg2]  ; 

A [arg2]  -  tO  |  tl; 


/*  32-bit  operation  ! !  */ 


11.7.  Eor 


eor  () 

< 

to  *  A [argl] ; 
tl  *  A [arg2] ; 

A [arg2 ]  -  to  '  tl; 


/*  32-bit  operation  ! !  */ 


11.8.  Memread 

memread ( ) 

A[arg2]  »  memread (A [argl]  +  arg3) ; 

) 


11.9.  Mem  write 

memwrite () 

memwrite (A [argl]  4  arg3,  A[arg2]); 

) 
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11.10.  Coderead 

coderead () 

< 

A[7]  «  coderead (A [argl] )  ; 

} 


11.11.  Codewrite 

codewrite { ) 

{ 

codewrite (A [argl] ,  A[7]); 

1 


11.12.  Jump 

Same  as  execute,  except  that  the  CUT  bit  is  not  affected. 


jump  () 

{ 

P  ■  argl; 

) 


11.13.  Jit 

jit  () 

1 

if  (value (A [argl] )  <  value (A[arg2] ) )  P  «  P  +  arg3; 

} 


11.14.  Jeq 

j«q() 

{ 

if  (A[argl]  —  A[arg2])  P  -  P  +  arg3;  /*  32-bit  equality!!  */ 

) 


11.15.  Jle 

jle() 

i 

if  (value (A [argl] )  <«  value (A[arg2] ) )  P  «  P  +  arg3; 

} 
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11.16.  JumpXn 

jumpxn ( ) 

1 

P  -  A[argl] ; 

} 


11.17.  LoadN 


loadn ( ) 

{ 

N  «  argl; 

} 


12.  Test  Programs 

The  following  test  programs  at  one  time  were  not  correctly  executed  due  to  microcode  bugs.  They 
all  now  work  correctly  on  the  VLSI-PLM  (however,  some  versions  of  the  PLM  level  1,  PLM  leve!2,  and 
PPP  level  1  simulators  have  not  been  corrected).  The  programs  are  given  here  as  test  cases  for  new  simula¬ 
tors  and  PLM  implementations. 

12.1.  Dereferencing 

Check  dereferencing  of  arguments  in  general  unify: 

main  a (X) ,  a (Y) ,  b(X,Y),  c(X),  d(X,Y),  write (Y) ,  nl . 

a([S,aJ).  %  create  a  list  on  the  heap  with  a  variable  in  it 

b  ( [A|_]  ,  [A|_]  )  .  %  bind  the  two  variables  together 

c([al_]).  %  bind  a  constant  to  the  variables 

%  at  this  point  one  list  has  a  constant  as  its  first  element 

%  and  the  other  has  a  variable  bound  to  a  constant 

d(A,A).  %  this  unification  should  succeed 

%  fails  here  if  variables  not  deref'd  in  unification 


12.2.  Cdr  Bits 

Check  that  NIL,  when  used  as  a  constant,  does  not  have  its  cdr  bit  set: 


main  *([[], []  ]  , [A,B] ) ,  writ* (A) ,  nl,  writ* (B) ,  nl. 
»(X,X)  . 


Another  check  on  the  constant  NIL: 
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main  write  ( [x,  []])  ,  nl. 


Test  that  cdr  bit  stays  the  same  during  variable  binding: 

main  A  «  [x|X],  B  «  [x,Y],  X  «  Y,  Y  «  a,  write (x (A, B) ) ,  nl. 


Test  cdr  links: 


main  a ([A] ) ,  X  -  [alY],  Y  -  [b|A],  b (X) ,  write (X) ,  nl,  fail. 

main  a { [A] ) ,  X  -  [a|Y],  Y  -  [b|A],  A  ■  [c] ,  b(X),  write (X) ,  nl,  fail. 

a  (_)  . 

b  (  [  A,  B,  C]  )  a  (  [A,  B,  C]  )  . 


Another  test  of  cdr  links: 

main  a  (  [X]  )  ,  A  «  [a|Y],  Y  -  [b  |  X]  ,  X  -  [c],  b  (A)  ,  write  (A)  ,  nl,  fail, 
main  :-a([X]),  A-  [a|Y],  Y  -  fb | X] ,  b (A) ,  write (A) . 
a  (_)  . 

b ( [X, Y, 2] )  a (X) ,  a (Y) ,  a(2). 


Check  that  the  cdr  bit  of  an  unbound  variable  is  preserved  by  get_st  ructure  : 


main  write ( [ a  I b (x) ] ) ,  nl . 


Check  decdring  in  general  unify: 


%  from  Chien  Chen 

main  a (A) ,  b(B),  c (A,  B) ,  d (A) . 
a ( [a|X] ) . 
b ( [a  I  foo (b, c) ] ) . 
c (X,  X). 

d<[  IX])  c (X,  foo (b, c) ) . 


fwv  that  the  cdr  bit  is  cleared  before  writing  to  the  heap  in  unif  y_value  (write  mode): 

%  from  Barry  Fagin 

main  a ([HIT]),  X  -  Y-T,  foo(X),  write (X) . 

a  (_) . 
f o© (X-X)  . 


Test  unify_void : 
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main  y(A),  write (a), 

z (A, Z) ,  write  (b) , 
a ( [X, Y I Z]  )  ,  write (c)  , 
z (M, L) ,  write  (d)  , 
z  (M,  [a]  )  ,  write  (e)  , 
a ( [a  I L]  )  . 
a  J)  . 

y  (A)  . 
z  (A,  A)  . 


12  .3.  Unsafe  Variables 

Test  comparison  for  current  environment  in  put_unsaf e_value  : 

main  a (X) ,  a(Y),  b(X,Y),  c(X,Y).  %  this  just  causes  to  put_unsafe's 

%  but  one  does  not  transfer  pointer  to  heap 

a  (X)  . 
b  (X,  X)  . 

c(X,Y)  d(X,Y),  e.  %  'e'  is  just  to  force  an  allocate 

d(X,Y)  a (X) ,  a (Y) ,  f (X, Y) ,  e.  %  another  allocate  that  destroys 

%  pointer  for  X  and  Y 
e . 

f  (a, a)  .  %  this  should  succeed 

Test  put_unsaf  e_value  when  two  variable  are  bound  together  in  the  current  environment  (the 
end  of  the  variable  chain  must  be  changed  to  point  to  a  newly  created  unbound  variable  on  the  heap): 

%  from  Jeff  Gee: 
n  a  (X,  Y)  ,  b  (Y)  ,  write  (X)  ,  nl . 

a  (V,  V)  . 
b ( joe) . 

Check  that  unify_value  does  unsafe  variable  globalization  in  write  mode: 

main  a(X),  d(X),  a,  write (X) ,  nl. 
a (Y)  c (X, Y) ,  b(X) . 

b<_>  . 

c(X,  [X]).  %  called  from  a/1  with  X  set  to  unbnd  var  on  stack 

d([x])  . 

e  b  (X) ,  b  (X)  . 

Test  that  the  overwritten  variable  on  the  stack  in  unif  y_value  (write  mode)  is  trailed. 
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main  a(X,Y),  b(2),  c (X, Y) ,  d(2). 

a  (X,X)  . 

b(H)  . 
b  ( [a,  b,  c] )  . 

c (X,  Y)  var  (X) ,  ! . 

c (X, Y)  write (' *»*  BDG  ***'),  nl . 

d  ( I _ I _ 3  )  • 

12.4.  De  trailing 

Test  that  multiple  setargs  (to  tbe  same  location)  are  untrailed  properly: 

main  X  *  a (a),  b (X) ,  fail. 

b (X)  write (X) ,  nl, 

setarg(l,  X,  b) ,  write (X) ,  nl, 
setargd,  X,  c)  ,  write  (X)  ,  nl . 

b (X)  write (X) ,  nl. 


13.  Suggestions  for  Future  Instruction  Sets 

13.1.  Eliminate  Cdr  coding 

Cdr  coding  has  been  shown  not  to  yield  any  performance  advantage  [ToDe],  and  it  complicates  the 
microcode.  Many  of  the  last  bugs  to  be  removed  from  the  VLSI-PLM  microcode  where  related  to  cdr  cod¬ 
ing. 

13.2.  Eliminate  Unsafe  Variables 

Unsafe  variables  have  also  been  tbe  source  of  several  bugs.  By  changing  put_variable  Yi 
to  create  an  unbound  variable  on  tbe  heap  and  a  pointer  to  it  in  tbe  environment,  unsafe  variables  are  elim¬ 
inated  Tbe  benefits  include  elimination  of  tbe  put_unsafe_value  instruction,  simplification  of  tbe 
trail  routine  (one  comparison  rather  than  three),  and  simplification  of  unify_value  in  write  mode.  Tbe 
drawbacks  include  an  extra  dereference  link  and  the  creation  of  more  garbage  on  tbe  heap.  Tbe  highly 
recursive  Takeuchi  function  (see  Gabriel ’s  lisp  benchmarks)  is  an  example  of  tbe  second  drawback: 
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main  tak (18 , 12 , 6 , A) ,  write (A) ,  nl . 

tak (X, Y,  Z , A)  : - 

X  -<  Y,  ! , 

Z  -  A. 

tak  (X,  Y,  Z ,  A)  :  - 

XI  is  X-l, 

tak (XI,  Y,  Z,A1)  , 

Y1  is  Y-l, 
tak (Yl, Z, X,  A2) , 

Z1  is  Z-l, 
tak (Z1,X,  Y,  A3> , 
tak (A1 , A2 , A3 ,  A)  . 

In  the  VLSI-PLM  the  variables  Al,  A2,  and  A3  are  allocated  on  the  stack,  and  the  heap  is  never  used. 
However,  if  the  these  variables  are  allocated  on  the  heap,  47,706  words  are  required. 
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